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"(N. 

Abstract. A graph G is called a replication graph of a graph H if G is 
obtained from H by replacing vertices of H by arbitrary cliques of vertices 
and then replacing each edge in H by all the edges between corresponding 
' cligues. For a given graph H the pn(H) is the minimal number of vertices of a 

replication graph G of H such that every proper vertex coloring of G contains 
a rainbow induced subgraph isomorphic to H having exactly one vertex in 
each replication clique of G. We prove some bounds for pn for some classes 
■J ' of graphs and compute some exact values. Also some experimental results 

obtained by a computer search are presented and conjectures based on them 
are formulated. 



c/2 . 1. Introduction 

° 

Rainbow induced subgraphs have been considered in many papers in connection 
with various problems of extremal graph theory. They have been considered both 
■ for edge-colorings and vertex-colorings, and both in terms of existence or in terms 

of avoiding (see [U O [3j EJ El [17j [18] ) . Our special motivation comes from research 
in on-line coloring (see 0[T4]), where the base for some algorithms is the existence 
of rainbow anticliques to force a player to use a new color. In particular, in |14j . 
a problem has been formulated to estimate the minimal number of moves in the 
game considered one needs to force the appearance of a rainbow copy of a fixed 
graph H in a fixed class of graphs C. 

This paper is continuation of where the number p{H) being the minimum 
order of a graph G such that every proper vertex coloring of G contains a rainbow 
induced subgraph isomorphic to H was introduced. It turned out that in certain 
situations, especially for paths, more natural and easier to handle is the number 
Pr{H) referring to replication graphs defined as follows. 

Given a graph H and a vertex v £ H, we construct a graph W by adding a new 
vertex v' and edges joing v' with v and all the neighbours of v in H. Aany graph 
G obtained fro H by a series of such constructions is called a replication graph of 
G. Note that vertices in G corresponding to a vertex v S H form a clique. 

If G is a replication graph of H such that in each proper vertex coloring of G 
there exists a rainbow induced subgraph H having exactly one vertex in each 
of the cliques Ki corresponding to a vertex hi in H, then we write G H. By 
pn(H) we denote the minimal number of vertices in any replication graph G of H 

satisfying G H. 

In this paper we provide some bounds for pfj for certain classes of graphs. Fol- 
lowing problems formulated in [llj our main interest is in paths. In addition, the 
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exact value of pr for a double star is computed. We present also some experimental 
results obtained by computer search for small paths and conjectures based on them. 

2. The Hall's type theorem 

We start from the following more general result. 

Theorem 2.1. Let H be a graph and G be its replication graph. The following 
conditions are equivalent: 

• For each subset S of the vertices of H , the chromatic number of the subgraph 
of G induced by the replication cliques corresponding to the vertices from S 
(denoted by G[S\) is at least \S\. 

• g4 h. 

Proof. Let the first condition be satisfied. We take any proper vertex coloring and 
we will show that there is a rainbow H having one vertex in each replication clique. 
We define a bipartite graph consisted of the set of vertices of H and the set of colors 
used in the coloring. A vertex v is connected with all colors used in the replication 
clique obtained from v. Since each subset of k vertices is connected with at least k 
colors then by Hall's theorem we obtain that there exist matching such that each 
vertex is matched with unique color which is connected to it. So the matching 
defines the colors of the vertices in replication cliques and selecting a vertex from 
each replication clique of these colors gives the rainbow induced subgraph H. 

Conversely, if for some subset S the chromatic number of the subgraph is less 
than \S\ then we could get a coloring which uses less than l^l colors for these cliques 
and extend it to the whole graph G obtaining a coloring from which we cannot select 
rainbow vertices of H from each replication clique. □ 

Lemma 2.2 ([19]). A replication of a perfect graph is perfect. 

Thus for perfect graphs we could consider the maximal clique instead of the 
chromatic number of the subgraph induced by a subset. Paths are perfect and so 
the replication graphs of them. The maximal clique in the replication graph of path 
is always formed by two adjacent replication cliques. 

3. The upper bound for paths 

Theorem 3.1. 




n 2 /4 + n 2 /16 + n/2 if n = mod 4 

n 2 /4 + n 2 /16 + 3n/8 + 5/16 if n = l mod 4 

n 2 /4 + n 2 /16 + n/2 - 1/4 if n = 2 mod 4 

n 2 /4 + n 2 /16 + 3n/8 + l/16 if n = 3 mod 4 



Proof. We will define a suitable G as a sequence of orders of replication cliques of 
P„, and show that each fc-subset of this sequence contains a number at least k or a 
pair of consecutive numbers in the sequence of sum at least k. 
(1) Assume that n = mod 4. 

Define four sequences of length n/4 of numbers: 

Si = (l,2,...,n/4) 
Ti = (n/2,n/2,...,n/2) 
S 2 = (n/4+l,n/4 + 2,...,n/2) 
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T 2 = (n/4, n/4,..., n/4, n/2) 

Then G is denned by alternately taking the numbers from S\ , T\ , S 2 , T 2 
in that order, so the first number is the first of Si, the second is the first 
of Ti, the fifth is the second of Si and so on. 

For example for n = 16: 

51 = (1,2,3,4) 
Ti = (8,8,8,8) 

5 2 = (5,6,7,8) 
T 2 = (4,4,4,8) 

S= (1,5,2,6,3,7,4,8) 

T= (8, 4, 8, 4, 8, 4, 8, 8) 

G= (1,8,5,4,2,8,6,4,3,8,7,4,4,8,8,8) 
It could be seen in this way: 

51 = ( 1 2 3 4 ) 
Ti = ( 8 8 8 8 ) 

5 2 = ( 5 6 7 8 ) 
T 2 = ( 4 4 4 8 ) 



G = (1854286438744888) 

Consider a fc-subset of numbers of G. By a value of the subset we mean 
maximum over numbers from the subset or sums of pairs of consecutive 
numbers in the sequence which are both in the subset. The subset contains 
Si numbers from Si, s 2 numbers from S 2 , ti numbers from Ti and t 2 
numbers from T 2 . 

Assume for the contrary that the value of the subset is less than k. There 
are the following cases: 

(a) If k < n/4 then all numbers in the subset must be from Si, but there 
are only k — 1 numbers less than k, so this is impossible. 

(b) If n/4 < k < n/2 there are not numbers from Ti. 

Consider a set Xi for 1 < i < n/4 — I which contains i-th numbers from 
S 2 and T 2 and (i + l)-th number from Si, but only those which are 
also in the fc-subset. In each Xi there could be at most two numbers 
because there cannot be a pair between S 2 and T 2 . There could be at 
most one number if i > k — n/4 because there cannot be a number 
from S 2 and cannot be a pair between T 2 and Si. In summary there 
are at most 

2 * (k - n/4 - 1) + ((n/4 - 1) - (k - n/4) + 1) = k - 2 
numbers in these X t sets. 

Except the first number 1 of the Si there could be only numbers from 
the sets Xi. So there is at most k — 1 numbers in the subset, which is 
a contradiction. 
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(c) Ifn/2<fc<n/2 + n/4 then we define Xi for each 1 < i < n/4 which 
contains i-th numbers from Si, T\, 52, T 2 but only those which are 
also in the fc-subset. 

Observe that in each Xi there cannot be both numbers from T\ and 
S 2 . Also there cannot be both from Si and T\ or both from S 2 and 
T 2 if i > k — n/2. So for i < k — n/2 there can be at most 3 numbers 
and for i > k — n/2 there can be at most 2 numbers. 
In summary in the Xi sets there could be at most 

3 * (k - n/2 - 1) + 2 * (n/4 - (k - n/2) + 1) = k - 1 

numbers. But every number from the subset would belong to some Xi 
so this is a maximal size of the fc-subset and a contradiction occurs. 

(d) If n/2 + n/4 < k < n then we observe that there could be at most 
k — n/2 — n/4 — 1 pairs between numbers from T\ and S 2 . So the 
maximal size of the fc-subset is 

3 * (n/4) + (k- n/2 - n/4 - 1) = k - 1. 

This a contradiction. 

(2) Assume that n = 1 mod 4. 

We define sequences: 

Si = (l,2,...,(n + 3)/4) of length (n + 3)/4 
Ti = ((n + 3)/4, (n + 3)/4, (n + 3)/4) of length (n - l)/4 
S 2 = ((n + 3)/4 + 1, (n + 3)/4 + 2, (n + l)/2) of length (n - l)/4 
T 2 = ((n + l)/2, (n + l)/2, (n + l)/2, (n + 3)/4) of length (n - l)/4 
And G in similar way as before. 
For example for n = 13: 

51 = ( 1 2 3 
Ti = ( 4 4 4 

5 2 = ( 5 6 7 
T 2 = ( 7 7 4 

G = (145724673474 

(3) Assume that n = 2 mod 4. 

We define sequences: 

Si = (1, 2, (n + 2)/4) of length (n + 2)/4 
Ti = (n/2, n/2, n/2) of length (n + 2)/4 
S 2 = ((n + 2)/4 + 1, (n + 2)/4 + 2, n/2) of length (n - 2)/4 
T 2 = ((n + 2)/4, (n + 2)/4, (n + 2)/4) of length (n - 2)/4 
And G in similar way as before. 
For example for n = 14: 

51 = ( 1 2 3 4 ) 
Ti = ( 7 7 7 7 ) 

5 2 = ( 5 6 7 ) 
T 2 = ( 4 4 4 ) 

G = (17542764377447) 



4 ) 
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(4) Assume that n = 3 mod 4. 
We define sequences: 

Si - (1, 2, (n + l)/4) of length (n + l)/4 

Ti - ((n + l)/4, (n + l)/4, (n + l)/4) of length (n + l)/4 

S 2 = ((n + l)/4 + 1, (n + l)/4 + 2, (n + l)/2) of length (n + l)/4 

T 2 = ((n + l)/2, (n + l)/2, (n + l)/2) of length (n - 3)/4 

And G in similar way as before. 
For example for n = 15: 

51 = ( 1 2 3 4 ) 
Ti = ( 4 4 4 4 ) 

5 2 = ( 5 6 7 8 ) 
T 2 = ( 8 8 8 ) 

G = (145824683478448) 



□ 



4. The lower bound for paths 

We will show that the exact value of pn(P n ) is above the simple bound n 2 /4 by 
an 0(n 2 ) component. 

R 

Lemma 4.1. Let G be a minimal graph such that G — > P n , so the order of G is 
Pr{P„). Let 

e = £min{L4|,|B|}, 

ab 

where each ab is an edge of P n between vertices a and b, A and B are replication 
cliques in G obtained from the vertices a and b. The following holds: 

n(n + 1) 



\G\ = PR {P n ) > 



— e 



Proof. We show that we could obtain from G a graph G' which is a replication 

R 

graph of A n such that G' — > A n by adding exactly e vertices. 

At first for each edge ab in P n let us mark the smaller replication clique A or 
B (or any of them if they are equal). The cliques can be marked marked twice 
(by two edges), once (by one edge) or left unmarked. Note that we have one more 
vertex than the number of edges, so at least one clique is unmarked. If a clique is 
unmarked then its neighbor cliques must be marked. 

For a graph H which is a replication graph of a disjoint union of paths U , such 

that G" -3- U, we define the procedure: Get a clique A which is unmarked and 
which is connected with one or two neighbor cliques. Then increase number of 
vertices in A by the sum of orders of the neighbor cliques, and remove connections 
between A and them. Also we remove one mark from each of the neighbor cliques. 

The obtained graph H' is a replication graph of V which is U without one or 
two edges. There is still an unmarked clique in U' having a neighbor or U' is just 

R 

An. We show that H' A U' . Get any fc-subset of U'. If the subset does not contain 
A then it has the same chromatic number in H' as in H, so it has at least k. If 
the subset contains A then assume that it has a smaller chromatic number than k 
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in H' . With the fact in H the subset has the chromatic number at least k it must 
come from an induced clique between A and one of its neighbor clique. But in H' 
the clique A' is at least as large as the induced clique and so k. 

We use the defined procedure for G and repeat it until we obtain a replication 
graph of A n . For each mark on a clique A we have added exactly \A\ vertices during 
the process, because we have not added vertices to marked cliques. So by a way 
in which we marked the cliques we have added exactly e vertices. The order of the 
result graph must be at least pr(A u ) = so the lemma holds. □ 

R 

Lemma 4.2. Let G be a replication graph of P n such that G —> P n and 2\n. Let 
c,x be any numbers from [0,1/2]. If 

\G\<(n/2)(n/2 + l)+ Cn{cn - 1) 

then the following holds: 

• Let X be a subset of cliques which have orders at most xn. Then 

2xn — cn < \X\ < 2xn. 

• Let X' be a subset of cliques which have orders at least xn. Then 

n — 2xn < \X'\ < n — 2xn + cn. 

• Let X" be a subset of cliques which have orders at least xn and at most yn. 
Then 

\X"\ > 2n(y-x) - cn. 

Proof. Assume that \G\ < (n/2)(n/2 + 1) + c "( c ^-~ 1 ) _ jf we sor t the replication 
cliques ascending by the order and consider i-th replication clique then it must 
have order at least \i/2~\. Otherwise if we get i-subset of the cliques smaller than 
\i/2~] then its the largest induced clique and so the chromatic number will be at 
most {[i/2] - 1) * 2 < i - 1. 

If an i-th replication clique of order k has more than \i/2~] vertices then we say 
that is has k — i/2 extra vertices. So we have exactly (n/2)(n/2 + 1) non-extra 
vertices in G. 

Consider a subset X of cliques which have orders at most xn. The clique of 
order not larger than xn can be at most 2xn-th clique in our order, so \X\ < 2xn. 
If \X\ < 2xn — cn then the i-th cliques where i = 2xn — cn, 2xn — cn + 1, 2xn — 1 
have at least xn — \(2xn — cn)/2] , xn — \(2xn — cn + l)/2] , xn ~ \(2xn — l)/2] 
extra vertices respectively. So they have at least cn/2 — l,cn/2 — 2,...,0 extra 
vertices respectively. In summary we have at least (cn/2 — l)(cn/2)/2 > c " ( -™~ 1 - ) 
extra vertices and it contradicts the assumption, so we are done in the first case. 

Consider a subset X 1 of cliques which have orders at least xn. So \X'\ > n — \X\. 
By our bounds \X'\ > n — 2xn and \X'\ < n — (2xn — cn) = n — 2xn + cn. So we 
are done in the second case. 

Consider a subset X" of cliques which have orders at least xn and at most yn. 
Then \X"\ > \X'\ — \Y'\ where X' contains cliques of orders at least xn and Y' 
contains cliques of orders at least yn. So \X"\ > n — 2xn — (n — 2yn + cn) — 
2n(y — x) — cn and we are done in the third case. □ 

Theorem 4.3. Let G be a replication graph of P n such that G — > P n and 2\n. Let 

a', a, b, c, d be the numbers from [0, 1/2] such that: 
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(1) b < a < a'. 

(2) a + |c+ d < a'. 

(3) a + |c+ d < 26. 
TTie following holds: 

, „, / , , , , , [cn(ca— 1) d(a — b) 9 1 
|G| > (n/2)(n/2 + l) + niiii|— ^ -: A ^™ J" 

Proof. Get a subset M of cliques of orders in [an, a'n]. Let TV be a subset of cliques 
of orders at most bn. Let M/v be a subset of cliques of orders at most bn which are 
also neighbors of some clique from M . 
We could bound e defined in 14. II 

e < Pr(Pu) - \M N \^^n. 

If for each edge we take the order of the clique obtained from the left vertex into e 
then e < \G\. Now for these edges which have the left vertex producing clique from 
M/v and right vertex producing clique from N we could take the order of the right 
clique instead of the left. For each such edge we get the number of vertices smaller 
at least by (a — b)n. At least half of the cliques from Mn have a neighbor from 
M on the left side or on the right side. If it is the first case then we have at least 
| Mm |/2 such edges, and in the second case we could inverse the argument to the 
right side. So we have decreased e from a value less than \G\ at least by \M^\ s ^n 
obtaining the upper bound for e in this way. 
If |Mjv| > dn then 

a — b 

e < PR(Pn) ~ dn—^—n 

and so by 14. II 

,„ x n(n + 1) ,„ , , a — b , 

PR(Pn) > V 2 ^ - (pR(Pn) - dn — n) 

,„n n(n + 1) d(a — b) 
Pr(Pu) > K 4 ' + -^n 2 

and the theorem holds. 

So assume now that \Mn\ < dn. Consider a subset X which consists of the 
cliques from N without the cliques from Mjy and with at least half of the cliques 
from M which are not adjacent in M. 

By 14.21 either the theorem holds or \M\ > 2n(a' — a) — cn and \N\ > 2bn — cn. 
We know \X\ > \N\ - |Mjv| + \M\/2 and so 

s , 2n(a' — a) — cn 
\X | > (2bn - cn) - dn + — i ^ 

|X| > (26 + (a 1 -a)-~c-d)n = Q 

We will show that the subset X has the chromatic number at most Q. At first 
we need to show that cliques from M cannot be larger that Q. We know that a 
maximal clique in M and so in X can have a'n vertices. So we need to show that: 

3 

(26 + (a' — a) — — c — d)n > a'n 

and this is equivalent of 

3 

26 > a + -c + d 
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Figure 1. A double star S(a,b) with anticliques A and B, and 
central vertices C and D. 



which comes from assumption (3). 

Secondary we need to show that X cannot have an induced clique of order 
greater than Q. Because only cliques which come from N — Mm can be connected 
the maximal induced clique here can be of order at most 2bn. So it is sufficient to 
show: 

3 

(26 + (a' -a)--c- d)n > 2bn 



and this is equivalent of 



3 . 
a > a + -c + a 



which comes from assumption (2). 

So the subset X has the chromatic number less at most Q < \X\ which contra- 

R 

diets that G — >• P n , and so the theorem holds. □ 
Corollary 4.4. 

Pr{Pu) > n 2 /4 + n 2 /784 + n/2. 

Proof. Let a' = 1/2, a = 1/4+1/14, b = 1/4, c = 1/14, d = 1/14. The condition (1) 
in[0]is clearly satisfied. Since fc + d = 5/28 the condition (2) 1/4 + 1/14 + 5/28 = 
1/2 < 1/2 is satisfied and also the condition (3)l/4 + l/4 + 5/28=l/2<2*l/4 = 
1/2 is satisfied. So: 



cn(cn — 1) 



7i7784 - 7i/56, 



4 

d( °- h) n 2 = „ 2 /7*4. 
4 

And by 14.31 we have finally 



Pr{Pu) > (n/2)(n/2 + 1) + n 2 /784 - 7i/56. 

□ 

5. The exact value of p R for a double star 

Let S(a, 6) be a double star of order 71 = a + 6 + 2 with two anticliques A of order 
a and B of order 6, and two central vertices C connected with A and D connected 
with B. 
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Our bounds yield: 

(n-2)(n-l) „ , xx (n-2)(n-l) , „ 
1 + 2 < p(S(a, &)) < ± 1 i + a + 6 + 2. 

Theorem 5.1. Assume that b > a > 1. TTie following holds: 

b 



rru i w (n-2)(n-l) 
/9 fl (S(a,6)) = i £ ^ + 



+ 3. 



Proof. First we will show that this number of vertices is sufficient to construct a 
suitable G. We need to define orders of replication cliques of S(a, b). 



Let g = 



. Let A' be the set of replication cliques of vertices of A in G, and 



a+l 

B' be the set of replication cliques of vertices of B in G. Let C be the replication 
clique of vertex C and D 1 be the replication clique of vertex D. Let c be the order 
of C and d be the order of D'. 

We get any order of the vertices {i>i, i>2, • ■ • , Vn-2} of A U B such that the length 
of the longest sequence of consecutive vertices from B is at most g. Each vertex Vi 
is replicated to a clique of order i. So cliques in B' would have orders 1,2,. . .,g,g + 
2, . . . , 2<7+l, 2g + 3, . . . and cliques in A' would have orders g+1, 2g + 2, . . .. Then let 
c = g + 1 and d = 2. We will show that the obtained graph G satisfies G -A S(a, b). 

Get a fc-subset of replication cliques of G. Let L be the set of orders of cliques 
from A' which are in the subset, and let R be the set of orders of cliques from B' 
which are in the subset. We consider four cases depending if C or D' is in the 
subset: 

(1) The subset does not contain C nor D'. So because cliques from A' U B' 
have 1, 2, . . . , n — 2 orders and we have fc of them there is a clique of order 
not less than fc. 

(2) The subset contains C but not D'. Because C is connected with the cliques 
from C there is a clique of order max(L) + g + 1 in the subset. Assume 
for the contrary that k > max(i) + g + 1. So the subset can contain all 
cliques of order not greater than max(L) from A' U B', there are max(L) 
such cliques. Also it contains C . So the remaining cliques must come from 
B' and there must be k — max(L) — 1 such cliques. All of them have orders 
between max(L) + 1 and k — 1 (inclusive) and there is exactly k — max(L) — 1 
such cliques in A' U B'. They all must be in B', but k — max(L) — 1 > g 
and we defined orders in such a way that the longest consecutive sequences 
of cliques' orders from B' has length g. So at least one clique is in A' so 
this is impossible. 

(3) The subset contains D' but not C. The subset contains k — 1 cliques from 
A' U B' and they must be all cliques of orders 1, 2, . . . , fc — 1. Since in our 
construction no two cliques in A' have orders different by 1 at least one of 
two largest cliques of orders k — 2 and k — 1 is in B' . So together with D' 
it forms a clique of order at least k — 2 + 2 = fc. 

(4) The subset contains both C and D' . There are fc — 2 cliques from A' US', so 
there must be a clique of order at least fc— 2 in the subset. So max(L) > fc— 2 
or max(i?) > fc — 2. In both cases there is a clique of order max(L) +g + 1 > 
fc-2 + g + l>fcor max(i?) + 2>fc-2 + 2 = fc. 

So the constructed graph G satisfies the property that for any fc-subset there is 
an induced clique of order at least fc, and so its chromatic number is at least fc. It 
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remains to show that any graph G requires such number of vertices. So let now 

G be any replication graph of S(a,b) such that G — > S(a,b). Observe that G is 
perfect, so each fc-subset must have an induced clique of order at least k. 

We sort orders of cliques from A'liB' and obtain an orders sequence s\ , S2 , ■ ■ ■ , s n -2 
of cliques Si, S2, ■ ■ ■ , SV1-2 where Sj < Sj if i < j. Note that Sj > i. If Sj — i > 
then we say that Si has Si — i extra vertices. Also we say that C has c — 1 extra 
vertices and D has d—1 extra vertices. We will show that G has at least g + 1 
extra vertices in summary. 

By definition of g we know that there exists k > g such that if we get the 
first k cliques in our sequence from A' U B' then the last g of them (which are 
Sk-g+i, • • • , Sk) come from B'. Consider a set of g subsets which has for i: k—g+2 < 
i < k + 1 an i-subset consisting of the clique C and the first i — 1 cliques in our 
sequence from A' U B' . For an i-subset we define L, to be a set of orders of cliques 
which comes from A' and Ri to be a set of orders of cliques which comes from 
B'. So each such i-subset has an induced clique of order at least i and so either 
i < max(L,) + c or i < max(i?i). If Li is empty then we define max(L^) — 0. In an 
i-subset the last i — (k — g + 1) > 1 cliques in our order come from B' , so we have 
max(Lj) < max(iij). 

If for an i-subset there is i < max(i?j) then i < Sj_i. It implies that Si-i from 
S' has at least one extra vertex. We consider two cases: 

(1) If all of these subsets satisfy i < max(iij) then there is an extra vertex in 
each of Sk-g+i, ■ ■ ■ , Sk- So we have at least g extra vertices in B' . 

If c > 2 or d > 2 then we have g + 1 extra vertices in summary. Otherwise 
consider any r-subset (2 < r < n) consisting of the first r — 2 cliques in our 
sequence from A'lJB' and the cliques C and D 1 . The largest clique of order 
at least r must consist of SV_2 and either C or D' , so s r _ 2 + 1 > r. So 
5 r _2 has at least one extra vertex. Because this holds for any such r-subset 
any clique from A' U B' has an extra vertex, and because a + b > g we have 
at least g + 1 extra vertices. 

(2) If for one of these subsets i > max(iij) then it must satisfy i < max(Li) +c. 
Consider the largest i-subset of them. So we have k — i + 1 extra vertices 
in Si, . . . , Sk from B' implied by the subsets larger than i. 

If Li is empty then we have i < c and so C has i — 1 extra vertices. 
Together with the fc — i + 1 extra vertices from B 1 we have k > g extra 
vertices in summary in B' and C. 

If Li is non-empty then let max(Li) = Sj. 5^ G A' and so j < fc — g 
because the cliques Sk- g +i, ■ ■ ■ , Sk are in B' . Having i < Sj + c we could 
write i < j + ej + c where is the number of extra vertices in Sj . From 
these we obtain i < (k — g) + ej + c and so ej + c — 1 > i — k + g — 1. 
Obtained e^- + c — 1 is the number of extra vertices in both Sj G A' and 
C". If we add the k — i + 1 extra vertices from B' then we have at least 
(i — k + g — 1) + (k — i + 1) = g extra vertices in summary in A' U B' and 

a. 

If d > 2 then we are done, so assume that d = 1. 

If we add £>' to the i-subset from this case and obtain an (i + l)-subset 
then it must have an induced clique of order at least i + 1. So we have 
the three following subcases depending on what the clique of order at least 
i + 1 consist of: 
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(a) If the clique consists of D' and some clique from B' then i + 1 = 
max(Ri) + 1 and we have a contradiction as we assumed i > max(i?i). 

(b) If the clique consists of C and some clique from A 1 (or just C if Li 
is empty) then i + 1 = max(Li) + c and we could go similar as in the 
case: 

If Li is empty then i + 1 = c and the i extra vertices in C together 
with the k — i + 1 extra vertices from £?' give fc + 1 > g + 1 extra 
vertices in summary. 

If Li is non-empty then we could write i + 1 < Sj + c obtaining (i + 
1 — k + g — l) + (k — i + = .9 + 1 extra vertices in summary in A' U B' 
and C. 

(c) If the clique consists of C and D' then i + 1 < c + 1 and so i < c. 

If Li is non-empty then i + 1 < max(Li) + c and we could follow the 
previous subcase. 

Assume that Li is empty. C has i — 1 extra vertices and together 
with the k — i + 1 extra vertices from B' we have k extra vertices in 
summary in C and 

If fc > g then we are done, so assume that k — g. Note that we have 
counted extra vertices from B' which are only in the first g cliques in 
our sequence from A'liB', also we have counted only one extra vertex 
in one clique of B' . 

If a = 1 and 6 = 1 then 5(1, 1) is P4 and the theorem is true. Otherwise 
there are at least 5 + 2 cliques in A' U B' . We will show that there is 
an extra vertex in the cliques 5„_3 or S n -2 which are not in the first 
g cliques in our sequence, or in some clique in A' U B' there are two 
extra vertices. 

Get a n-subset consisting of all cliques. There are three sub-subcases 
depending where is the clique of order at least n. 

• If c + 1 > n then c has n — 2 > g extra vertices. 

• If s„_2 + 1 > n (when 5„_2 S B') then 5„_2 has one extra 
vertex. 

• If s n -2 + c > n (when 5„_2 £ A') then we remove C from the 
subset and obtain either s„_ 2 > n — 1 or Sj + 1 > n — 1 for some 
j < n — 3 (Sj E B'). In the first case we are done. In the second 
case we have Sj > n — 2 and so Sj E B' has at least one extra 
vertex if j = n — 3 or has at least two extra vertices if j < n — 3. 

So in all cases we have at least g + 1 extra vertices and the theorem holds. 

□ 



6. Experiments 

Lemma 6.1. For a given graph G which is a replication graph of P n , the problem 
of verifying if G — >• P n can be solved in time 0(n 2 ) and memory 0(n). 

By computer search, we have found all minimal order replication graphs for each 
path up to 16 vertices and so we have the exact value of pn(P n ) for n < 16. These 
shows that the upper bound is tend to be very close to the exact value, especially 
we have the conjectures: 
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Conjecture 6.2. For odd n > 1 the upper bound is tight: 

n 2 /4 + n 2 /16 + 3n/8 + 5/16 if n = 1 mod 4 



PR(Pn) | n 2 /4 + n 2 /16 + 3n/8+ 1/16 if n = 3 mod 4 
Conjecture 6.3. For any n > 1 f/iere is Pr(P„) = n 2 /4 + n 2 /16 + O(n). 

Here are the exact values, the number of minimal order replication graphs and 
representations of some of them. 
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Pr{Ps) = W(5graphs) 

1) 1 2 2 2 3 

2) 1 2 3 2 2 

3) 2 1 3 2 2 

4) 2 2 1 2 3 

5) 3 1 1 2 3 

pB.(Pe>) = U{Ugraphs) 
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Pb.(P7) = 18(3graphs) 

1) 1234224 

2) 1322433 

3) 2332134 

Pij(P s ) = 23(l 3 rapft) 
1) 14422433 



Pr{P 9 ) = 29(20graphs) 
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Pr(Piq) = 35(3graphs) 

1) 1345235633 

2) 1542255335 

3) 5325431246 



Pr(Pu) = 42(16graphs) 
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pB.(Pi2) = 49(lgraph) 
1) 165236633644 

Pr.(Pis) = 58(116graphs) 

Pr(Pu) = 66(3graphs) 

1) 14572467347844 

2) 17532763377447 

3) 17623663377447 
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We have observed also an interesting property about the cycles and state the 
following conjecture: 

Conjecture 6.4. p R (C n ) = PR.{P n ) for n>6. 

We note that pr{C^) = 6, p{C^) = 9. The conjecture was checked for n < 12. 
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